Klasifikasi Pertanyaan Bidang Akademik Berdasarkan 5W1H menggunakan K-Nearest Neighbors
نویسندگان
چکیده
Pertanyaan merupakan metode terbaik dan termudah untuk menggali sebuah informasi. Menurut aturan 5W1H, terdapat enam bentuk dasar pertanyaan yang dapat digunakan memperoleh informasi, yaitu: what, where, when, why, who, how. Banyak jurnalis menggunakan ini, karena diimplementasikan dengan cepat mudah membangun pertanyaan. Untuk membuat sistem memahami pertanyaan, misalnya seperti pada chatbot, khusus harus diterapkan membedakan keenam jenis ada. Penelitian ini mencoba melakukan klasifikasi terhadap dokumen berdasarkan tokenisasi stemming tahap pra-pemrosesan, kemudian K-Nearest Neighbors (K-NN) mengklasifikasikan Berdasarkan hasil pengujian, nilai akurasi tertinggi adalah 70.27% k = 5.
منابع مشابه
Search K Nearest Neighbors on Air
While the K-Nearest-Neighbor (KNN) problem is well studied in the traditional wired, disk-based client-server environment, it has not been tackled in a wireless broadcast environment. In this paper, the problem of organizing location dependent data and answering KNN queries on air are investigated. The linear property of wireless broadcast media and power conserving requirement of mobile device...
متن کاملk*-Nearest Neighbors: From Global to Local
The weighted k-nearest neighbors algorithm is one of the most fundamental nonparametric methods in pattern recognition and machine learning. The question of setting the optimal number of neighbors as well as the optimal weights has received much attention throughout the years, nevertheless this problem seems to have remained unsettled. In this paper we offer a simple approach to locally weighte...
متن کاملPredicting Medical Conditions Using k-Nearest Neighbors
As the healthcare industry becomes more reliant upon electronic records, the amount of medical data available for analysis increases exponentially. While this information contains valuable statistics, the sheer volume makes it difficult to analyze without efficient algorithms. By using machine learning to classify medical data, diagnoses can become more efficient, accurate, and accessible for t...
متن کاملOptical Character Recognition, Using K-Nearest Neighbors
The problem of optical character recognition, OCR, has been widely discussed in the literature. Having a hand-written text, the program aims at recognizing the text. Even though there are several approaches to this issue, it is still an open problem. In this paper we would like to propose an approach that uses K-nearest neighbors algorithm, and has the accuracy of more than 90%. The training an...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: JEPIN (Jurnal Edukasi dan Penelitian Informatika)
سال: 2021
ISSN: ['2548-9364', '2460-0741']
DOI: https://doi.org/10.26418/jp.v7i1.45322